Llama 3 1 Nemotron Ultra 253B V1
Other
A large language model derived from Meta Llama-3.1-405B-Instruct, optimized through neural architecture search technology, supporting 128K tokens context length, suitable for reasoning, dialogue, and instruction-following tasks.
Large Language Model
Transformers English